AD“A261^^^  VIENTATION  PAGE 


horm  Appfovecf 
0M8  No  0^04  omai 


"  .  0**'  4 a  *"<>■.  ■  '  i 


1,  AGENCY  USE  ONLY  (Leave  I  2  REPCAT  3A': 


5  REPOR'"  r-'PE  AND  OATES  COVERED 

FINAL/01  JUN  89  TO  31  DEC  92 


4.  TITLE  AND  SUBTITLE 

EFFICIENT  COMMUNICATION  FOR  PARALLEL  COMPUTING 


S  fUNOlNG  NUMBERS 


16.  AUTHORiS) 


DR . /BHATT _ 

7.  PERFORMING  ORGANIiATION  S.4Vt 
YALE  UNIVERISTY 
1504A  YALE  STATION 
NEW  HAVEN,  CT  06511 


2 3 04/A 3 
AFOSR-89-0382 


6  PERFORMING  ORGANIZATION 
REPORT  NUMBER 


9,  SPONSORING  MONITORING  AU-sC' 
AFOSR/NM 

110  DUNCAN  AVE,  SUTE  B115 
BOLLING  AFB  DC  20332-0001 


I  11.  SUPPLEMENTARY  NOTES 


i;)iREl5(£l 


ELECTE 
MAR  05  1993 


10.  SPONSORING '  MONITORING 
AGENCY  REPORT  NUMBER 

AFOSR-89-0382 


I  12a.  DISTRIBUTION  ‘  AVAILABlLiT'^  S' A  ”.vf  \T 


12b.  DISTRIBUTION  CODE 


APPROVED  FOR  PUBLIC  RELEASE:  DISTRIBUTION  IS  UNLIMITED 


1  13.  ABSTRACT  (Maximijm  200  wo'a!) 


The  main  accomplishments  during  this  grant  are  in  the  area  of  routing  algorithms 
for  binary  cube  networks . 


93-04658 


14.  SUBJECT  TERMS 


1 15.  NUMBER  OF  PAGES 


16.  PRICE  CODE 

17.  SECURITY  CLASSIFICATION  18.  SECURITY  CLASSIFICATION  I  19.  SECURITY  CLASSIFICATION  20.  LIMITATION  OF  ABSTRACT 

OF  REPORT  OF  THIS  PAGE  I  OF  ABSTRACT 


NSN  7540-01  280-5500 


98 


3  4  0^8 


StandarbTorm  295  (SeV 
i98-^02 


DISCLAIEI  NOnCE 


THIS  DOCUMENT  IS  BEST 
QUALITY  AVAILABLE.  THE  COPY 
FURNISHED  TO  DTIC  CONTAINED 
A  SIGNIFICANT  NUMBER  OF 
PAGES  WHICH  DO  NOT 
REPRODUCE  LEGIBLY. 


I 
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S.  LENNART  JOHNSSON 
January  1993 


1.  Summary 

The  main  accomplishments  during  this  grant  are  in  the  area  of  routing  algorithms  for  binary 
cube  networks. 

It  is  well  known  that  meshes  are  subgraphs  of  binary  cubes.  Binary- reflected  Gray  codes 
are  often  used  to  assign  data  elements  to  nodes  in  the  cube  in  order  to  preserve  adjacency. 
It  hcis  been  conjectured  that  the  minimum  number  of  element  exchanges  to  perform  the 
conversion  between  the  standard  binary  code  and  the  binary- reflected  Gray  code  is  equal 
to  ^{n  —  1),  where  M  is  the  size  of  the  data  set  in  each  node,  and  n  the  dimension  of  the 
binary  cube.  We  have  shown  this  conjecture  to  be  false,  and  provided  an  algorithm  that 
realizes  the  conversion  in  time  for  n  >  2.  For  n  =  2  the  conversion  time  has  been  known 
to  be  ^  -f- 1.  Our  algorithm  for  n  >  2  is  quite  simple. 

In  a  joint  work  with  Alan  Edelman  of  UC  Berkeley,  and  Steve  Heller  of  Thinking  Machines 
Corp.  we  have  also  shown  that  the  ability  to  construct  pipelined  algorithms  for  binary  code 
to  binary-reflected  Gray  code  conversion  is  related  to  the  ability  to  factor  without  pivoting  a 
matrix  defining  the  conversion  between  binary  code  and  binary-reflected  Gray.  The  matrix 
representation  provides  a  convenient  way  of  characterizing  algorithms  for  a  class  of  routing 
tasks. 

Two  communication  algorithms  devised  earlier  h2is  been  implemented  on  the  Connection 
Machine  systems  CM-2  and  CM  200  in  collaboration  with  Jean-Philippe  Brunet  of  Think¬ 
ing  Machines  Corporation,  and  Brickner  and  Bill  George  of  Los  Alamos  National 

Laboratories.  Both  algorithms  iri<  ■< ms^s  the  effective  use  of  the  communications  bandwidth 
of  the  Connection  Machine  system-  Fhe  all-to-all  broadcast  routine  optimally  yields  a  per¬ 
formance  enhancement  by  a  factor  •d'  two  over  alternate  ways  of  performing  the  operation. 
In  practice  and  improvement  by  up  to  a  factor  of  ten  has  been  observed.  For  the  polyshift 
operation  an  improvement  by  a  fact(;r  of  up  to  four  has  been  observed. 

In  collaboration  with  Kapil  Mathur  of  Thinking  Machines  Corporation  and  Zdenek  Johan 
of  Stanford  University  we  implemented  randomization  techniques  for  sparse  gather/scatter 
operations  on  the  Connection  Machine  systems  CM-2  and  CM-200.  The  gather/scatter 
utilities  improved  the  performance  by  up  to  a  factor  of  five.  For  large  unstructured  finite 
element  grid  codes  the  the  result  is  that  no  more  than  about  half  the  time  is  spent  in 
communication. 

The  all-to-all  broadcast  routine  is  currently  being  extended  to  all-to-all  reduction.  During 
the  fall  we  initiated  two  efforts  towards  the  understanding  of  effective  load-balancing  for 


dynamic,  unstructured  computations,  with  the  goal  being  applications  such 
cell  codes. 
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During  the  grant,  results  of  the  research  have  appeared  in  several  journal  and  conference 
publications,  and  been  disseminated  in  invited  lectures. 

2.  Journal  publications 

“Boolean  Cube  Emulation  of  Butterfly  Networks  Encoded  by  Gray  Code”  (with 
Ching-Tien  Ho),  to  appear  in  the  Journal  of  Parallel  and  Distributed  Computing. 

“POLYSHIFT  Communications  Software  for  the  Connection  Machine  System 
CM-200”,  (with  Ralph  Brickner  and  William  George),  to  appear  in  the  Journal 
of  Scientific  Programming. 

“An  Efficient  Algorithm  for  Gray-to-Binary  Permutation  on  Hypercubes”,  (with 
Ching-Tien  Ho  and  M.T.  Paghunath),  to  appear  in  Journal  of  Parallel  and  Dis¬ 
tributed  Computing. 

“Minimizing  the  Communication  Time  for  Matrix  Multiplication  on  Multipro¬ 
cessors”,  to  appear  in  Journal  of  Parallel  Computing. 

“Optimal  Communication  Channel  Utilization  for  Matrix  Transposition  and  Re¬ 
lated  Permutations  on  Boolean  Cubes”,  (with  Ching-Tien  Ho)  to  appear  in  the 

Journal  of  Discrete  Applied  .Mathematics. 

“Block  Cyclic  Dense  Linear  Algebra”,  (with  Woody  Lichtenstein),  SIAM  J.  ofSci. 
Comp.,  vol.  14,  no.  5,  1993. 

“Embedding  Hyper-pyramids  in  Hypercubes”,  (with  Ching-Tien  Ho),  to  appear 
in  IBM  Journal  of  Research  and  Development,  1992. 

“Communication  Primitives  for  Unstructured  Finite  Element  Simulations  on 
Data  Parallel  Architectures”,  (with  Kapil  K.  Mathur)  Computig  Systems  in  Engi¬ 
neering,  vol.  3,  Nos  1-4,  pp,  63  -  72,  1992.  (Also  presented  at  Symposium  on 
High  Performance  Computing  for  Plight  Vehicles,  December  7-  9,  1992.) 

“Local  Basic  Linear  Algebra  Subroutines  (BLAS)  on  the  Connection  Machine 
System  CM-200”,  (with  Luis  Ortiz),  International  Journal  of  Supercomputer  Applica¬ 
tions,  vol.  7,  no.  1,  1993. 

“All-to-All  Broadcast  with  \  fiplications  on  the  Connection  Machine”,  (with 
Jean-Philippe  Brunet),  Int’^  ■■’-.ni  Journal  of  Supercomputer  Applications,  vol.  6, 
no  3,  pp.  241-  256,  1992. 

“A  Data  Parallel  Finite  Element  Method  for  Computational  Fluid  Dynamics  on 
the  Connection  Machine  Systems”,  (with  Zdenek  Johan,  Tom  Hughes  and  Kapil 
K.  Mathur),  Computer  Methods  m  Applied  Mechanics  and  Engineering,  vol.  99,  no.  1, 
pp.  113  -  134,  August  1992. 

“Communication  Efficient  Multi-Processor  FFT”,  (with  Michel  Jacquemin  and 
Robert  L.  Krawitz),  Journal  of  Computational  Physics,  vol.  102,  no.  2,  pp.  381-397, 
October  1992. 

“Cooley-Tukey  FFT  on  the  Connection  Machine”,  (with  Robert  L.  Krawitz), 
Journal  of  Parallel  Computing,  vol.  18,  no.  11,  pp.  1201-1221,  1992. 


‘‘Generalized  Shuffle  Permutations  on  Boolean  Cubes”,  (with  Ching-Tien  Ho), 
Journal  of  Parallel  and  Distributed  Computing,  vol  16.,  no.  1,  pp.  1-14,  1992. 

“The  Parallel  Multipole  Method  on  the  Connection  Machine”,  (with  Feng  Zhao), 
SIAM  J.  Sci.  Stat.  Comp.,  vol.  12,  no.  6,  pp.  1420-1437,  November  1991. 

“Performance  Modeling  of  Distributed  Memory  Architectures”,  Journal  of  Dis¬ 
tributed  and  Parallel  Computing,  vol.  12,  no.  4,  pp.  300-312,  1991. 

“QCD  on  the  Connection  Machine:  Beyond  *Lisp”,  (with  Ralph  G.  Brickner 
and  Clive  F.  Baillie),  Computer  Physics  Communications,  vol.  65,  pages  39-51,  1991. 

“A  Data  Parallel  Implementation  of  an  Explicit  Method  for  the  Compressible 
Navier-Stokes  Equations  for  Three-Dimensional  Channel  Flow”,  (with  Pelle  Ols 
son)  Journal  of  Parallel  Computing,  vol.  14,  no.  1,  pp.  1-  30,  1990. 

“Optimizing  TVidiagonal  Solvers  for  the  Alternating  Direction  Method  on  Boolean 
Cube  Multiprocessors”,  (with  Ching-Tien  Ho),  SIAM  Journal  on  Sci.  Stat.  Comp., 
vol.  11,  no.  3,  pp.  563-592,  May  1990. 

“Data  Structures  and  Algorithms  for  the  Finite  Element  Method  on  a  Data 
Parallel  Supercomputer”,  (with  Kapil  Mathur),  International  Journal  of  Numerical 
Methods  in  Engineering,  vol.  29,  no.  4,  pp.  881-908,  April  1990. 

"Embedding  Meshes  in  Boolean  Cubes  by  Graph  Decomposition”,  (with  Ching- 
Tien  Ho),  the  Journal  of  Parallel  and  Distributed  Computing,  vol.  8,  no  4,  pp.  325-339, 
April  1990. 

“Experience  with  the  Conjugate  Gradient  Method  for  Stress  Analysis  on  a  Data 
Parallel  Supercomputer”,  (with  Kapil  K.  Mathur)  International  Journal  on  Numer¬ 
ical  Methods  in  Engineering,  vol.  27,  no.  3,  pp.  523-546,  December  1989, 

“Boundary  Modifications  of  the  Dissipation  Operators  for  the  Three-Dimensional 
Euler  Equations,  (with  Pelle  Olsson),  Journal  of  Scientific  Computing,  vol.  4,  no.  2, 
pp.  159-195,  June,  1989. 

“The  Finite  Element  Method  on  a  Data  Parallel  Computing  System”,  (with 
Kapil  Mathur),  Int.  Journal  of  ''-qk-Speed  Computing,  vol.  1,  no.  1,  pp,  29  -  44, 
May  1989. 

“Histogram  Computation  on  Distributed  Memory  Architectures”  (with  Dimitris 
C.  Gerogiannis  and  Stelios  C.  Orphanoudakis),  Journal  on  Concurrency:  Practice  and 
Experience,  vol.  1,  no.  2,  pp.  219-237,  December  1989. 

“Spanning  Graphs  for  Optimum  Broadcasting  and  Personalized  Communication 
in  Hypercubes”,  (with  Ching-Tien  Ho),  IEEE  Trans.  Computers,  Vol.  38,  No.  9, 
pp.  1249-1268,  September,  1989. 

3.  Supported  Graduate  students 

Ted  Wesson,  Harvard  University,  part  time  in  1992. 

4.  Invited  presentations 

“Massively  Parallel  Computing”,  AFOSR  Workshop  on  Computational  Electromagnet- 


ics,  Hanscom  Air  Force  Base,  Mass.,  December  3  -  4,  1992. 

“Efficient  Massively  Parallel  Supercomputing”,  The  Fifth  ECMWF  Workshop  on  the 
Use  of  Parallel  Computers  in  Meteorology',  Reading,  England,  November  23  -  27, 

1992, 

“Scalable  Scientific  Libraries”,  First  International  Hem:  Stxdorf  Symposium  on  Farallti 
Architectures  and  their  Efficient  Use,  Paderborn,  Germany,  November  11  -  13,  1992. 

“Techniques  for  High  Performance  Scientific  Computing”,  Second  Symposiym  on 
the  Frontiers  of  Massively  Parallel  Computation,  Fairfax,  Virginia,  October  10  -  12, 
1992. 

“Scalable  Parallel  Libraries  and  the  Connection  Machine  system  CM-5”,  Rens- 
selear  Polytechnique  Institute,  September  10,  1992. 

“Techniques  for  High  Performance  Scientific  Computing”,  Parallel  Aspects  of  Nu- 
mercial  Linear  Algebra,  Copenliagen,  August  24  -  25,  1992. 

“Scientific  Libraries  for  Scalable  Architectures”,  IBM  Workshop  on  Scientific 
Libraries  for  Parallel  Architectures,  Oberlech,  July  12  -  18,  1992. 

“Run-Time  system  support  for  distributed  memory  machine  compilers”,  Third 
Workshop  on  Compilers  for  Parallel  Computers,  Vienna,  July  6  -  8,  1992. 

“Computational  Fluid  Dynamics  on  Massively  Parallel  Architectures"  ,  2nd  In¬ 
ternational  Conference  on  Spectral  and  High  Order  Methods,  Montpellier,  June  22  -  26, 
1992. 

“Scientific  Libraries  for  Scalable  Architectures”,  7th  (MACS  Conference  on  Compter 
Methods  for  PDFs,  New  Brunswick,  June  22  -  24,  1992. 

“A  Data  Parallel  Finite  Element  Method  for  CFD  on  the  Connection  Machine 
Systems”,  Parallel  CFD  '92,  I mpU  mentation  and  Results  Using  Parallel  Computers,  Rut¬ 
gers  University,  New  Brunswick,  May  18  -  20,  1992. 

“Massively  Parallel  Compuiitig;  Numerical  and  Computer  Science  issues”,  The 
ONR  Workshop  on  Domam-Sp' ■  ,  Parallelism;  CS,  NA,  Physics,  Los  Angeles,  CA, 

May  14  -  15,  1992. 

“Communication  primitives  and  their  implementation  for  distributed  memory 
architectures”,  Standards  fo>  U-  -./c(  Passing  in  a  Distributed  Memory  Environment, 
Williamsburgh,  VA,  April  29  -  30,  1992. 

“Mathematical  Software”,  A. LS  I  [Workshop  on  Systems  Software  and  Tools  for  High 
Performance  Computing  Environments,  Pasadena,  CA,  April  14  -  16,  1992. 

“Electronic  Parallel  Architectures”,  AFOSR  and  NSF  Workshop  on  Reconfigurable 
Free-Space  Optical  Interconnect,  Boulder,  CO,  March  11  -  13,  1992. 

“Massively  Parallel  Computing:  MIMD  vs.  Data  Parallel”,  Compcon  92,  San 
Frc  nsisco,  February  24  -  28,  1992. 

“Algorithms  and  Software  Techniques  for  Scientific  Applications  on  Scalable  Ar¬ 
chitectures”,  Workshop  on  Parallel  Computing  for  3D  Plasma  Simulation,  Albuquerque, 


January  15  -  17,  1992. 


1991 

“Scientific  Libraries  for  Scalable  Architectures”,  The  Danish  Institute  of  Tech¬ 
nology,  Copenhagen,  December  19th,  1991. 

“How  can  Models  Promote  Main  Stream  Parallelism”,  Panel,  The  Thrid  IEEE 
Symposinm  on  Parallel  and  Distributed  Processing,  Dallas,  December  5,  1991. 

“Scientific  Libraries  for  Scalable  Architectures”,  2nd  BLACS  Workshop,  Cornell 
University,  October  14-15,  1991. 

“Scientific  Libraries  for  Scalable  Architectures”,  Distinguished  Lecturer,  the  Min¬ 
nesota  Supercomputer  Institute,  University  of  Minnesota,  Minneapolis,  October  2, 
1991. 

“Techniques  for  Efficient  Data  Motion  in  Large  Scale  Distributed  Memory  Sys¬ 
tems”,  Workshop  on  Interconnection  Networks,  Marseille,  July  15-19,  1991. 

“Performance  Modeling  of  Distributed  Memory  Architectures”,  Workshop  on 
Conceptual  Models  of  Parallel  Scientific  Computation,  Seattle,  June  27-29,  1991. 

“Language  and  Compiler  Issues  in  Building  Scalable  High  Performance  Scien¬ 
tific  Libraries”,  NSF-NCRD  Workshop  on  Advanced  Compilation  Techniques  for  Novel 
Architectures,  Kiryat-Anavim,  Israel  May  27-30,  1991. 

“  Data  Parallel  Programming:  Programming  Primitives  and  Performance”,  Tu¬ 
torial  at  the  Symposium  on  Principles  and  Practices  of  Parallel  Programming,  PPoPP91, 
Williamsburg,  VA.,  April  21  1991. 

“Communication  in  Distributed  Memory  Architectures” ,  workshop  on  Basic  Lin¬ 
ear  Algebra  Communication  Subroutines,  Houston,  March  28,  1991, 

“Making  Fortran  90  work  fast”,  workshop  on  DARPA/ISTO  Software  PI  Meeting, 
Warwick,  RI,  February  26-29,  1991. 

“Communication  Primitives  for  Distributed  Data  Structures”,  workshop  on  Global 
Climate  Modeling,  the  National  Center  for  Atmospheric  Research,  Boulder,  Jan¬ 
uary  9-10,  1991. 


1990 

“Software  Libraries  for  Scalable  Supercomputers”,  workshop  on  Reliable  Large 
Scale  Scientific  Computation,  Rensselear  Polytechnic  Institute,  TVoy,  December  11- 
12,  1990. 

“Software  Libraries  for  Data  Parallel  Languages”,  workshop  on  Parallel  Processors 
in  Meteorology,  Reading,  England,  N  vember  26-30,  1990. 

“The  Connection  Machine  Scientific  Software  Library”,  Los  Alamos  National 
Laboratories,  Los  Alamos,  November  16,  1990. 


“Scientific  Supercomputing  and  Computer  Science  in  the  1990’ies”,  Harvard 
University,  Cambridge,  November  8,  1990. 

“Scientific  Applications  on  Data  Parallel  Architectures:  Techniques  for  High 
Performance”,  Princeton  University,  Princeton,  November  5,  1990. 

“Communication  Libraries”,  Workshop  on  Scalable  Parallel  Libraries,  Oak  Ridge 
National  Laboratories,  Oak  Ridge,  September  6-7,  1990. 

“Obstacles  to  the  Development  of  Parallel  Libraries”,  Workshop  on  Scalable  Par¬ 
allel  Libraries,  Oak  Ridge  National  Libraries,  Oak  Ridge,  September  6-7,  1990. 

“Linear  Algebra  on  Data  Parallel  Architectures”  The  Northeast  Parallel  Architectures 
Center  Summer  Institute,  Syracuse  University,  Syracuse,  July  25,  1990. 

“The  Connection  Machine  Scientific  Software  Library”,  The  Northeast  Parallel  Ar¬ 
chitectures  Center  Summer  Institnh  ,  Syracuse  University,  Syracuse,  July  25,  1990. 

“Issues  in  the  design  of  a  Scientific  Library”,  Householder  Symposium  XI,  Tylosand, 
Sweden,  June  18-22,  1990. 

“Basic  Array  Operations”  Householder  Symposium  XI,  TylSsand,  Sweden,  June  18- 
22,  1990. 

“The  Symmetric  Eigenproblem”  Householder  Symposium  XI,  Tylosand,  Sweden, 
June  18-22,  1990. 

“Implementation  on  the  Connection  Machine  -  Experiences”  Householder  Sympo¬ 
sium  XI,  Tylosand,  Sweden,  June  18-22,  1990. 

“Teraflop  computation:  Distributed  and  Shared  Memory”,  Workshop  on  Accel¬ 
eration  Algorithms,  Boston  University,  April  13,  1990. 

“Connection  Machine  Applications”,  The  Royal  Institute  of  Technology,  Jan¬ 
uary,  1990. 


1989 


“Data  Parallel  Supe^computiIl^  Cornell  University,  Ithaca,  New  York,  October 
26,  1989. 

“  Data  Parallel  Supercomputing  ",  Argonne  National  Laboratories,  Mathemati¬ 
cal  and  Computational  Sciences  Division,  October  3,  1989. 

“Graph  Embeddings  on  Hypercubes”,  University  of  Chicago,  Department  of 
Computer  Science,  October  2,  1989. 

“Graph  Embeddings  on  Hypercubes,  NEC  Research  Institute,  Princeton,  Septem¬ 
ber  25,  1989. 

‘Data  Parallel  Supercomputing”,  Amoco  Research,  Tulsa,  September  21,  1989. 

“Graph  Embeddings  on  Hypercubes”,  University  of  Tulsa,  Tulsa,  September  21, 
1989. 


“Fluent  Supercomputing”,  Second  Swedish  Workshop  on  Computer  Systems 
Architecture”,  B^lsta,  Sweden,  August  21-23,  1989, 

“Graph  Embeddings  on  Hypercubes”,  Brown  University,  Providence,  RI,  Au¬ 
gust  10,  1989. 

“A  Linear  Algebra  Library  for  the  Connection  Machine”,  SIAM  Annual  Meeting, 
San  Diego,  CA,  July  17-21,  1989 

“Data  Parallel  Supercomputing”,  Conference  on  Preconditioned  Conjugate  Gradient 
Methods,  Universty  of  Nijmegen,  Holland,  June  19-21,  1989. 

“High  Performance  Computing”,  The  Seventh  ARMY  Conference  on  Applied  Mathe¬ 
matics  and  Computing,  West  Point,  New  York,  June  7,  1989. 

“Scientific  Applications  on  the  Connection  Machine”,  RIACS,  Moffet  Field,  CA, 
May  10,  1989. 

“Data  Parallel  Supercomputing”,  CERFACS,  Toulousse,  France,  April  26,  1989. 

“Scientific  Applications  on  the  Connection  Machine”,  The  Royal  Institute  of  Tech¬ 
nology,  Stockholm,  Sweden,  February  9,  1989. 

“Solving  the  Wide-Angle  Wave  Equation  on  a  Data  Parallel  Computer”,  Sec¬ 
ond  IMACS  Symposium  on  Computational  Acoustics,  Princeton  University,  Princeton, 
New  Jersey,  March  1989. 


5.  Refereed  conference  papers  and  book  chapters 

“Massively  Parallel  Computing:  Mathematics  and  Communications  Libraries”, 
to  appear  in  Parallel  Computing  in  Meteorology,  World  Scientific,  1993. 

“Massively  Parallel  Computing:  Data  distribution  and  communication”,  to  ap¬ 
pear  in  Parallel  Architectures  and  their  Efficient  Use,  Springer  Verlag,  1993 

“Language  and  Compiler  Issues  in  Scalable  High  Performance  Libraries”,  to 
appear  in  Compilation  Techniques  for  Sovcl  Architectures,  Springer  Verlag,  1993. 

’’Matrix  Multiplication  on  Hypercubes  Using  Full  Bandwidth  and  Constant  Stor- 
(with  Ching-Tien  Ho),  in  Proceedings  of  the  Sixth  Distributed  Memory  Com¬ 
puting  Conference,  pp.  447-451,  IEEE  Computer  Society  Press,  April,  1991. 

’’Maximizing  Channel  Utilization  for  All-to-All  Personalized  Communication 
on  Boolean  cubes”,  (with  Ching-Tien  Ho)  in  Praceedings  of  the  Sixth  Distributed 
Memory  Computing  Conference,  pp.  299-304,  IEEE  Computer  Society  Press,  April, 
1991. 

’’Embedding  Three-Dimensional  Meshes  in  Boolean  Cubes  by  Graph  Decompo¬ 
sition”,  (with  Ching-Tien  Ho),  in  Proceedings  of  The  1990  International  Conference 
on  Parallel  Processing,  pp.  319-326,  IEEE  Computer  Society,  August,  1990. 

“Embedding  Meshes  into  Small  Boolean  Cubes”,  (with  Ching-Tien  Ho),  in  Pro¬ 
ceedings  of  The  Fifth  Distributed  Memory  Computing  Conference,  pp.  1366-1374,  IEEE 
Computer  Society,  April,  1990. 

“The  Complexity  of  Reshaping  Arrays  on  Boolean  Cubes”,  (with  Ching-Tien 
Ho),  Proceedings  of  The  Fifth  Distributed  Memory  Computing  Conference,  pp.  370- 
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